A novel approach to phonetic segmentation through local singularity analysis of speech
نویسندگان
چکیده
This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF provides efficient means for studying local non-linear dynamics of complex signals. The formalism is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain. In this paper we introduce an efficient way of estimating these local parameters and then, we show that they convey instructive information about local dynamics of the speech signal that can be used for determination of phoneme boundaries. We thus develop a two-stage phonetic segmentation algorithm that uses these parameters to automatically detect the boundaries between phonemes. For the first step, we introduce a new dynamic programming technique to efficiently generate a list of phoneme boundary candidates from a piece-wise linear functional. In the second step, we use hypothesis testing to make the final decision. We present extensive experiments on the full TIMIT database. The results show that our algorithm is significantly more accurate than state-of-the-art ones.
منابع مشابه
Phonetic segmentation of speech signal using local singularity analysis
This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain and provides efficient means for studying local non-linear dynamics of complex signals. In this paper we introduce a...
متن کاملA novel text-independent phonetic segmentation algorithm based on the microcanonical multiscale formalism
We propose a radically novel approach to analyze speech signals from a statistical physics perspective. Our approach is based on a new framework, the Microcanonical Multiscale Formalism (MMF), which is based on the computation of singularity exponents, defined at each point in the signal domain. The latter allows nonlinear analysis of complex dynamics and, particularly, characterizes the interm...
متن کاملMinimum boundary error training for automatic phonetic segmentation
Annotated speech corpora are indispensable to various areas of speech research. In this paper, we present a novel discriminative training approach for HMM-based automatic phonetic segmentation. The objective of the proposed minimum boundary error (MBE) discriminative training approach is to minimize the expected boundary errors over a set of phonetic alignments represented as a phonetic lattice...
متن کاملمقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملUsing Fuzzy Logic and Features Measured from the Time Domain to Achieve Smart Separation of Phonetic Units
The segmentation of uttered speech into phonetic units is a key processing task for successfully implementing speech recognition systems. This paper presents a smart approach to phonetic segmentation of uttered speech that separates vowels from consonants. Time-domain feature-extraction algorithms are applied to speech to extract features at minimum computational cost. Fuzzy decision logic is u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013